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Abstract. Homeobox genes belong to a phylogeneti- 
cally widespread family of regulatory genes that play im¬ 
portant roles in pattern formation and cell-fate specifi¬ 
cation in several model systems (e.g., Drosophila, mouse, 
and C. elegans). Although the evolution of many classes 
of homeobox genes predates the diversification of the Bi- 
lateria, comparatively little is known about homeobox 
genes in outgroups to the Bilatcria, such as the Cnidaria. 
We used the polymerase chain reaction to recover 12 
partial homeoboxes from 2 species of sea anemones, Ale- 
tridinm senile and Nematostella vectensis (phylum Cni¬ 
daria; class Anthozoa). These homeoboxes appear to rep¬ 
resent 9 distinct, mutually paralogous homeobox genes, 
5 of which belong to previously identified cnidarian ho¬ 
meobox classes, and 4 of which appear to represent pre¬ 
viously unidentified classes. The evolutionary relation¬ 
ships between the homeodomains of sea anemones and 
of bilaterian animals were assessed through database 
searches and phylogenetic analyses. As many as 5 of the 
anemone homeoboxes may belong to the Hox class, 
which suggests that the Hox gene complement of cni- 
darians is larger than previously expected. Homologs of 
the even-skipped gene of Drosophila were also identified 
in both Metriditnn and Nematostella. 

Introduction 

Homeobox genes encode a family of transcription fac¬ 
tors that are characterized by the presence of a DNA- 
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binding domain consisting of 60 amino acids and known 
as the homeodomain. Homeobox genes have been found 
in all metazoan phyla that have been surveyed, as well as 
in plants and fungi (reviewed in Biirglin, 1994). Recent 
reports of homeobox genes from basal metazoans, such 
as sponges and cnidarians, are revising our understand¬ 
ing of the evolution of this multigene family (Murtha et 
al., 1991; Schierwater et al., 1991; Miles and Miller, 
1992; Naito et al., 1993; Kruse et al., 1994, Seimiya et 
al., 1994; Degnan et al., 1995; Kuhn et al., 1996). Re¬ 
search on homeobox genes has stimulated an ongoing 
synthesis between the fields of evolutionary biology and 
developmental genetics because of the possibility that 
homeobox genes have played an evolutionary role in the 
diversification of metazoan body plans. 

We identified homeobox-containing genes in two spe¬ 
cies of sea anemone, Metridium senile and Nematostella 
vectensis. Sea anemones belong to the Cnidaria, a phy¬ 
lum of tentacle-bearing, radially symmetric animals that 
possess nematocysts (Hyman, 1940). Substantial phylo¬ 
genetic evidence places the Cnidaria near the base of the 
Eumetazoa, possibly as the sister group to the Bilateria. 
The Cnidaria share with bilaterian animals several de¬ 
rived characters that distinguish eumetazoans from 
sponges, including the possession of epithelio-muscle 
cells and nerve cells. However, the cnidarian body plan 
lacks several derived features of the Bilateria; organ-level 
organization, a well-differentiated mesoderm, a coelom. 
Furthermore, cnidarians display radial or biradial sym¬ 
metry instead of bilateral symmetry. The relative sim¬ 
plicity of the cnidarian body plan suggests that cni¬ 
darians may possess a network of developmental regula¬ 
tory genes that is relatively simple and retains more of 
the primitive characteristics of early eumetazoan ani¬ 
mals than do networks found among the bilaterians. 
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Therefore, study of cnidarians may provide unique in¬ 
sights into the evolution of developmental regulatory 
genes, such as homeobox genes. In any event, the Cnida- 
ria are an outgroup to the Bilateria, so regardless of the 
degree of simplicity in their developmental regulatory 
networks, they will provide an important phylogenetic 
perspective on the evolution of development in the 
Metazoa. 

Among extant Cnidaria, the Anthozoa (sea anemones 
and corals) are thought to be the sister group to a clade 
comprising the Cubozoa, Scyphozoa, and Hydrozoa, 
and it has been argued that the anthozoan body plan 
most resembles that of the ancestral cnidarians (Grass- 
hoff, 1984; Schick, 1991; Bridge etal., 1992; Odorico and 
Miller, 1997). Furthermore, the Anthozoa have a sim¬ 
pler life cycle than other Cnidaria, lacking the presum¬ 
ably derived medusoid stage that characterizes the Scy¬ 
phozoa and the Hydrozoa. We have chosen to study ho¬ 
meobox genes in the Anthozoa that we might better 
understand the ancestral function of these genes in the 
Cnidaria and in the Eumetazoa. 

Two techniques have been used to rapidly survey 
metazoan genomes for the presence of homeoboxes: (1) 
PCR (polymerase chain reaction) amplification of geno¬ 
mic DNA with degenerate primers corresponding to 
conserved regions of the homeodomain, and (2) library 
screening with a conserved oligonucleotide probe. More 
than 20 homeoboxes have been isolated from 8 cni- 
darian species by using these techniques (Murtha ct al., 
1991; Schierwater el al., 1991; Miles and Miller, 1992; 
Schummer et al.. 1992; Naito el al., 1993; Shenk cl al., 
1993; Aerne cl al., 1995). As many as five distinct ho¬ 
meoboxes have been recovered from individual hydro- 
zoans, C/ilorohydra viridissima (Schummer ct al., 1992) 
and Elcutheria dicholoma (Kuhn ct al., 1996). In a re¬ 
cent classification scheme (Naito ct al., 1993) cnidarian 
homeoboxes are sorted into eight mutually paralogous 
classes (Table I), but 18 of 20 of the cnidarian genes are 
from members of the class Hydrozoa. Little is known of 
homeobox genes in other classes, including the corals 
and anemones of the class Anthozoa. We have used PCR 
to survey the genome of two anthozoans for the presence 
of homeobox genes. 

Materials and Methods 

DNA extraction 

Live anemones were obtained from William Zamer at 
Lake Forest College (Lake Forest, IL; Mciridium) and 
Kevin Uhlinger at Bodega Bay Marine Laboratory (Bo¬ 
dega Bay, CA; Nenuitostclla). All animals were starved 
for 2 weeks and rinsed several times in sterile artificial 
seawater prior to DNA extraction. DNA extraction was 
performed with proteinase K and RNase digestion fol¬ 


lowed by several phenol/chloroform extractions (Sam- 
brook cl al., 1989). 

PCR amplification 

Degenerate primers corresponding to highly con¬ 
served regions in helices one and three of the homeodo¬ 
main were used to amplify an 82-bp fragment from 
genomic DNA (nucleotides 60 through 141 of the ho¬ 
meobox, not including the primers). The upstream 
primer (5-GARYTIGARAARGARTT-3') corresponded 
to the amino acid sequence ELEKEF, and the down¬ 
stream primer (5-CKNCKRTTYTGRAACCA-3') cor¬ 
responded to the reverse complement for the amino acid 
sequence WFQNRR. The 5' ends of the primers were 
phosphorylated to facilitate cloning PCR products. Both 
primers were synthesized commercially (Operon Tech¬ 
nologies, Alameda, CA). 

PCR was performed in 50-/d reactions containing 
50 m \1 Tris (pH 8.3), 10 mM KC1, 3.5 mAl MgCL, 
200 pM dNTPs, 0.6 Units Taq polymerase, 12.5 pmol of 
each primer, and 150 ng genomic DNA. Thermal cy¬ 
cling was performed in a MiniCycler (MJ Research, Wa¬ 
tertown, MA) under the following parameters: 5 cycles 
(45-s each) of denaturation at 94°C, 45 s of primer an¬ 
nealing at 37°C, and 45 s of primer extension at 72°C, 
followed by 35 cycles in which the annealing tempera¬ 
ture was raised to 42°C. Eight separate 50-^1 PCR reac¬ 
tions were performed for each species, and these were 
pooled prior to cloning to minimize the effects of sto¬ 
chastic variation known as PCR drift (Wagner et al., 
1994) Twenty-five microliters of the pooled reaction 
products were visualized on a 2% agarose gel stained with 
ethidium bromide, and the expected amplification prod¬ 
uct (114 bp) was observed. Control reactions lacking one 
primer of a primer pair failed to produce the expected 
amplification product. Control reactions lacking tem¬ 
plate produced no visible amplification products. 

Cloning and sequencing 

PCR products were blunted prior to cloning by using 
T4 DNA Polymerase (Costa and Weiner, 1994). The en¬ 
tire reaction was then run out on a 0.8% SeaPlaque GTG 
agarose gel (FMC Bioproducts, Rockland, ME) in a 
modified TAE buffer (0.04 4/ Tris-acetate, 0.2 mAl 
EDTA). The appropriate band was excised from the gel, 
melted, and added directly to a ligation reaction. PCR 
products were cloned into pUCl 8 plasmid that had been 
cut with Smal and treated with phosphatase (Kalvako- 
lanu and Livingston, 1991). Following alkaline denatur¬ 
ation, double-stranded sequencing was performed on re¬ 
combinant clones; the Sequenase v2.0 kit (USB, Cleve¬ 
land, OH) was used with 35 S-labeled dATP and the M13 
forward primer. Alternatively, automated sequencing 
was performed with dye-labeled dideoxynucleotides and 
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Table I 


Previou sly identified homeoboxes from the phylum Cmdaria 


Class 3 

[putative homology] 

Anthozoa h-c 

gene/[method of isolation] 

Hydrozoa bc 

gene/[method of isolation] 

Class 1 


(7W.v/-Hm/[Af 

[HOM/Hox class] 


t7jf/.v/-Ed/[A] 2 

S. lavi-Sa/lA] 1 
cnoxl-H v/[A] 5 
cm/vJ-Hv/jA] 5 

Class II 


iW>.v2-Hv/[A] 5 

[ Delormed of Dm soplula] 


rwn2-Hm/[A] 4 

twuJ-Ed/[A] 2 

(V?m2-Hs/[A] 2 

(7?<u2-Cv/[B] 6 

N.Jo.v2-Sa/[A]' 

Class III 


ivia\3-Cv/[B] 6 

[Oni(lD). Drosophila] 


177UY*TCv/[B] 6 

Class IV 


rwn4-Hm/[A] 4 

[HOM/Hox class] 


aw.xt-C\f[B] b 

Class V 

[Max 1 of mouse] 


imid-Hm/IA] 4 

Class VI 

antpC- Af/[B] 3 

5’.‘t(u7-Sa/[A] 1 

m 


t /ia\7-Pc/[A, B, C] 7 

Class VII 

[even-skipped class] 

eveC- Af/[B] S 


Class VI11 
[w.s/Mike class] 


»i.s7?-Cv/[B] f ’ 


a Naito el al. (1993) divided all previously identified cnidarian homeoboxes into eight classes. Cno.xl-Pc and antpC were not part of the original 
classification scheme but have been assigned to Class VI based on phylogenetic analyses (see Figs. 2 and 3). 

h All cnidarian homeobox sequences identified to date are from the Anlhozoa and the Hydrozoa; the methods of isolation are indicated in brackets: 
[A] PCR, [B] genomic library screening, [C]cDNA library screening. Nohomeoboxes have yet been recovered from the classes Cubozoa or Scyphozoa. 
Two-letter species abbreviations are appended to the name of each gene: Af = Acropora formosa, Cv = Chlorohydra virklissima, Ed = Eleulheria 
dichotoma. Hm Hydra magnipapillala, Us ~ / tydraciinia symbiolongiearpus, Hv = Hydra vulgaris. Pc = Podocorynecornea, Sa - Sarsia. 

c Homeoboxes listed here are from 1 Murtha el al. (1991), 2 Schierwater el al. (1991), * Miles and Miller (1992), 4 Naito el al. (1993), 5 Shenk el 
al. (1993), 6 Schummer el al. (1992), and 7 Aerne el al. (1995). 


an AB1 373A sequencer. Homeobox-containing clones 
were sequenced in the reverse direction with the M13 
reverse primer. Of the 44 clones sequenced for Metri- 
dium, 32 (73%) contained recognizable homeobox se¬ 
quences. Of the 47 clones sequenced for Nematoslella. 
35 (74%) contained recognizable homeobox sequences. 

Results and Discussion 

Homeoboxes identified by lhe PCR-based survey 

Our PCR-based survey identified 12 distinct homeo¬ 
boxes in two species of sea anemone: 7 genes from Me¬ 
tridium senile and 5 from Nematoslella veetensis (Fig. 
1). Consensus sequences for each of these 12 anemone 
homeobox gene fragments were submitted to GenBank 


(accession numbers U42726-U42737). To our knowl¬ 
edge, the 7 homeoboxes identified in Metridium are the 
most recovered from any single species of cnidarian. The 
recovery' of so many distinct genes may be attributable 
to the initial pooling of 8 separate PCR reactions 
(Wagner el al., 1994). The 12 anemone homeoboxes ap¬ 
pear to represent 9 paralogous homeobox genes referred 
to as ant box 1. ant box I a, antbox2, antbox4, anthox5, 
anib-bbxA, antb-bbxB, anth-bbxC, and anth-eve. The 
names were chosen to reflect putative homology between 
anemone homeoboxes and those from other cnidarians 
(see legend to Fig. 1). The PCR surveys identified over¬ 
lapping but different sets of homeoboxes in Metridium 
and Nematoslella. Ant box J, anth-eve, and antb-bbxA 
were recovered from both Metridium and Nematoslella. 
Antbox2, anthox4, antboxS, and antb-bbxB were recov- 
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anthoxl 

Metridiuw 
(5) 



ant hoxla 

Nematostella 

( 11 ) 

anthox2 

Metndiun i 
(5) 

(1) 

(1) 


anthoxl 

Mptridium 

(3) 

(1) 


(2) 

(2) 


Mutridi ll m 
(2) 

(1) 

Nematostella 

(7) 

(9) 

(1) 


antb-bbxA 


Metridi nm 
(1) 

Nematostella 

(5) 


antb-bbxB 


Metndium 

(7) 


antb-bbxC 

glia 

(1) 


10 20 30 40 


CAT TTT AAC CAT TTC CTG ACC AAA GAG CGA CGA TCA GAA 
HFNHFLTKEFRSE 


c . c . . ..t..c...t.a..a... ..ga.. , t . . . 

C CAT TTT ACG AGA TAT TTG ACA AAA GAG CGA CGC ACA GAG 
HFTRYLTKERRTE 


T CAC TAC AAT CGC TAC CTT TGT CGA CCA AGA AGA ATC GAG 
HYNRYLCRPRRIE 


50 60 70 80 

ATG GCC ACA CAG TTA AAC CTT ACA GAT CGA CAA GTA AAG ATT 
MATQLNLTDRQVK I 


cgt t.g.c . .g a a.g .. ..t ..a c 

ATG GCT AGA ATG CTT GAT CTA ACA GAA CGT CAA GTA AAA ATC 
MARMLDLTERQVKI 


ATA GCT CAG TCA TTA GAT CTT ACA GAA AAA CAG GTG AAA ATA 
IAQSLDLTEKQVKI 


.- g. 

.t ... 

.g. 

C CTA TAC TCG AGA TAT CTG ACC AGA ACA AGA AGG CTA GAG CTC GCT AAA TCC CTG GGA CTC TCA GAA AAA CAT CTC AAA ATA 
LYSRYLTRTRRLELAKSLGLSEKHLKI 


G ACC AAG AAT AAC TAT 
T K N N Y 


CTA ACA AGA CTA AGA CGA 
L T R L R R 


TAT GAA ATC GCC ATG 
Y E I A M 


GCG TTA GAT TTG GCT 
A L D L A 


GAA AGA CAG GTA AAA GTT 
E R 0 V K V 


C TTA CGA GAA AAC TAT GTG TCA AGA ACC CGA CGC TGT GAA CTG GCA AAC TCC CTA AGC CTC TCA GAA ACA ACT ATT AAG ATA 
LRENYVSRTRRCELANSLSLSETTIKI 


g 


. a.g a.g ..g ..t.g c.c ... a.g.c . .g.c ... g.g . .c .a.t . .c ..c.c 

a.ga.g ..g ..t.gc.c . .a a.g.c ..g.c ... g.g ..c .a.t ..c ..c. 

. a.g a.g ..g ..t.g c.c a a.g.c ..g.c . .t g.g . .c a.t . c ..c. 

T GGC GAA GAG AAA TAT CTG ACG GAA GCA AAA CGC GCA GAA CTC TCC AAA GAC CTG GGA ATG ACG GAA ACC CAA GTG AAA ACA 
GEEKYLTEAKRAELSKDLGMTETQVKT 


c aaa.ct.t.c.c..g..t.g.c..c.a .g . t . . g . . t 

C GAA CGA CAG CAA TAC ATG GTC GGG GCC GAG AGA CAT TAC CTC GCA GCG TCG CTA AAC CTT ACA GAA ACG CAA GTC AAA GTT 
ERQQYMVGAERHYLAASLNLTETQVKV 


T GAG GCG AAG AAG TAT CTG ACA GCC ACG GAA CGA AGT GAT ATG GCT TCA CTT TTG AAC GTT ACA GAA ACA CAA GTA AAA ATA 
EAKKYLTATERSDMASLLNVTETQVK I 


Figure I. Nucleotide alignments of homeobox-coniaimng clones representing 12 distinct homeobox 
genes, 7 from Metridiuni senile and 5 from Xenuilostelhi veetensis. The nucleotide alignment spans posi¬ 
tions 61-142 of the 180-bp homeobox (amino acids 21-47 of the homeodomam). Nucleotide alignments 
and phylogenetic analysis suggest that these 12 homeohox fragments represent at least 9 distinct homeohox 
classes or cognate groups (in boldface type). The term anihox is used to designate anemone homeoboxes 
with putative homology to Hox class homeoboxes ( anth ozoan Hox homeobox). The term anth-hhx desig¬ 
nates anemone homeoboxes lacking similarity to the Hox class. The term anth-eve indicates the apparent 
homology of this anemone homeobox fragment to the even-skipped homeobox of Drosophila The se¬ 
quences of individual clones (lowercase) are aligned to a consensus sequence (uppercase). For anthoxl, 
anthoxl . antho\4, anthoxS, anih.hhx.i, anth.hhxB, and anth-eve, the consensus sequence shown is from 
Metndium. For anth.hbxC and anthoxl a the consensus sequence shown is from A 'enialoste/la The in¬ 
ferred amino acid sequences are indicated below the consensus. The number of clones possessing a given 
sequence is indicated in parentheses. Identity with the consensus sequence is indicated with a period. 


ered only in Metndium. Anthoxl a and anth-hbxC were 
recovered only in Nematostella. 

Comparison of homeoboxes from anemones and other 
cnidarians 

We wanted to know if the homeoboxes we isolated are 
orthologous to previously identified homeoboxes from 


cnidarians (Table 1; Naito et ai, 1993), or if they repre¬ 
sent new classes of cnidarian homeoboxes. We used phy¬ 
logenetic methods to evaluate the evolutionary relation¬ 
ships between representatives of the 8 putative cnidarian 
classes and the 12 homeoboxes recovered from Metri- 
clium and Nematostella (Saitou and Nei, 1987; Fig. 2). A 
neighbor-joining tree (Saitou and Nei, 1987) was con¬ 
structed using the program Phylip (version 3.5, 
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Figure 2. Phylogenetic relationships among cnidarian homeodomains were inferred using the neighbor¬ 
joining algorithm (Saitou and Nei, 1987) of PHY LIP (Phylogenetic Inference Package, version 3.5; Felsenstein, 
1989). The data consisted of predicted amino acid sequences (positions 21 -47; see Fig. 1) from the 12 anemone 
homeodomains and 13 additional cnidarian homeodomains. Anemone homeodomains are indicated in bold¬ 
face type. The cnidarian sequences were chosen to represent all 8 of the cnidarian homeobox classes proposed 
by Naito cl al. (1993). Boxes enclose representative sequences from each class (roman numerals; see Table I). 
Sequences not enclosed in boxes (cnoxl Pc and antpC) were not included in the classification scheme of Naito 
et al. (1993). The tree show n is unrooted. Distances between homeodomains were calculated using the Protdist 
program of PHYLIP and the PAM-Davhoff amino acid substitution matrix (Dayhoff, 1978). Branch lengths 
in the horizontal direction are proportional to the number of expected amino acid substitutions that have 
occurred per site (scale at lower right). Branch lengths in the vertical direction are not significant. Circled 
numbers along branches indicate the percentage of trials in which a given partition between genes was found in 
2000 replications of the bootstrap resampling procedure (Felsenstein, 1985). Only partitions supported in >40% 
of bootstrap replicates are indicated. 
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Figure 3. Phylogenetic relationships among cnidarian homeodomains were inferred by parsimony. The 
tree depicted is the strict consensus of 27 equally parsimonious trees identified in a heuristic search with the 
computer program PA UP (Phylogenetic Analysis Using Parsimony, version 3.1, SwolTord, 1991). Tree-bisec¬ 
tion and reconnection branch swapping (TBR) was performed on 20 random starting trees. The tree shown is 
unrooted. The data consisted of predicted amino acid sequences (positions 21-47; see Fig. 1) from the 12 
anemone homeodomains and 13 additional cnidarian homeodomains representing 8 distinct classes (Naito el 
al., 1993; Table I). Anemone homeodomains are indicated in boldface type. Boxes enclose representatives from 
each class (roman numerals). Sequences not enclosed in boxes (cnoxJ.Pc and antpC) were not included in the 
classification scheme of Naito el al. (1993). Tree length is 162 steps. Consistency index (Cl) = 0,691. Retention 
index (Rl) = 0.695. Branch lengths in the horizontal direction are proportional to the number of amino acid 
substitutions that have occurred along each branch. The number of amino acid substitutions is specified above 
each branch. Branch lengths in the vertical direction are not significant. Circled numbers indicate the percent¬ 
age of trials in which a given partition between genes was found in 5C)0 replications of the bootstrap resampling 
procedure (Felsenstein, 1985). Only partitions supported in >50% of bootstrap replicates arc indicated. 
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anthoxl . Ms 

1. CnHv3 ( Hydra vulgaris) [M62872] 

2. Abd-A homolog ( Aedes aegypti) [P29552] 

3. Hox D3 (Homo sapiens) [S15548] 

4. CnHv2 ( Hydra vulgaris) [M62871] 

5. Hox 3 (Branchiostoma floridae) [D44629] 


HFNHFLTKERRSEMATQLNLTDRQVKI 

?.K_A. L. KH. . . SE . . I. ? 

....Y..RR..I.I.HA.C..E..I.. 
. . .RY.CRP. -V. .-NL... .E. .1. . 

?.K. . . .T.LSKK. . .SE. . I.? 

...RY.CRP..V...AM....E..1.. 


anthoxl.Nv 

1. CnHv2 (Hydra vulgaris) [M62871] 

2. CnHv3 (Hydra vulgaris) [M62872] 

3. Abd-A homolog (Aedes aegypti) [P29552] 

4. Hox D3 (Homo sapiens) [S15548] 

5. Hox 3 (Branchiostoma floridae) [D44629] 


HFNHFLTKERRSEMRSQLNLTERQVKI 

?.K....T.LSKK...S...1.? 

?.K. . . .A.LAKH. . -S. . .1.? 

. . . .Y. - RR. .I.IAHA.C.I - - 

. . .RY.CRP - .V. .ANL.I. . 

. . .RY.CRP. .V. .AAM.I. . 


anthoxla.Nv 

1. Hox A3 (Mus musculus) [P02831] 

2. Hox D3 (Homo sapiens) [S15548] 

3. Hox 3 (Branchiostoma floridae) [D44629] 

4. Hox B3 (Homo sapiens) [D37042] 

5. Hox B3 {Xenopus laevis) [P31247] 

anthox2.Ms 

1. Homeobox (Carassius auratus) [L09690] 

2. Hox A2 (Notophthalmus viridescens) [P31261] 

3. Hox B2 (Homo sapiens) [E37042] 

4. Hox B2 (Mus musculus) [A35511] 

5. Hox 2 (Saccoglossus kowalevskii) [C44636] 
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. .S 

. . 1 .. 
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. F . K . 






. F . K . 


.V. . 

. .AL. 


.R. . .V 

. F . K . 



. .AL. 


.R. . .V 
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. .SM. 

. .s 
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anthox4.Ms 

1. Hox C4 (Eattus norvegicus) [P18865] 

2. Hox 8 (Branchistoma floridae) [144629] 

3. Hox 5 (Saccoglossus kowalevskii) [F44636] 

4. Hox 4 ( Petromyzon marinus) [H48200] 

5. HB3 (Tripneustes gratilla) [P10178] 


LYSRYLTRTRRLELAKSLGLSEKHLKI 
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4. btn gene product (Drosophila) [S75228] CYS.VA.E.T.... 

5. mox-1 (Mus musculus) [P32442] AHH.VN...S.... 


anth-eve.Ms 

1. eveC (Acropora formosa ) [S36770] 

2. Xhox-3 (Xenopus laevis) [A60092] 

3. eve (Drosophila melanogaster) [X05138] 

4. EVX2 (Homo sapiens) [M59983] 

5. EVXl (Homo sapiens) [S22586] 
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Felsenstein, 1989; Fig. 2), and a parsimony tree was con¬ 
structed using the program PA UP (Phylogenetic Analysis 
Using Parsimony, version 3.1; S wo fiord, 1991; Fig. 3). 

On the gene phylogenies (Figs. 2 and 3), Class I, Class 
II, and Class IV, as defined by Naito el a/. (1993; Table I) 
are monophyletic. Among the anemone homeodo- 
mains, anthoxl of Metrulium seems to be orthologous 
to anthoxl of Nenuitostella. Anth-eve of Metridiuni and 
Nematostella also appear orthologous, as do anth-hbxA 
of Metndium and Nematostella. The phylogenies also 
permit a tentative assignment of some of the anemone 
homeodomains to specific cnidarian classes. Anth-eve 
appears orthologous to eveCond (Class VI1). Anthoxl 
and anthoxl a cluster with members of cnidarian Class 1 
that are believed to be related to the Antentuipedia- like, 
or Hox class, homeoboxes (Naito el al., 1993). Anthox5 
appears orthologous to enox5 (Class V). Other associa¬ 
tions appear more tenuous. 

Identity of anemone homeoboxes assessed by searching 
the Genbank database 

The sequences of anemone homeodomains inferred 
from the gene fragments recovered in this study were 
compared to sequences in GenBank with the BLASTx 
network search algorithm (Basic Local Alignment 
Search Tool; Altschul et al., 1990). The top five matches 
recovered in the BLAST searches are aligned to their 
anemone homeodomain counterparts in Figure 4. For 
each of the unique anemone homeodomain fragments, 
the search was limited to the 50 most similar genes in 
the GenBank database. In every case, all 50 of the genes 
identified were homeobox genes. At the amino acid level, 
anemone homeodomains are 59%-85% identical to the 
most similar homeobox genes in the Gen Bank database. 
The BLAST searches reveal that five of the anemone se¬ 
quences are most similar to homeodomains of the Hox 
class; these are anthoxl, anthoxl a. anthoxl, anlhoxA, 
and ant hox 5. The anth-eve homeodomains are most 
similar to even-skipped homologs from other species, 
particularly the even-skipped homolog of the staghorn 
coral (Miles and Miller, 1992). The anth-hbxA homeo¬ 
domains from both Metridinm and Nematostella appear 
most similar to the flatworm homeodomain Egllbxl 
(Oliver el al., 1992). Egllbxl has been referred to as an 
NK type homeobox, but this assignment is not supported 
by a recent homeobox gene tree (Biirglin, 1994). The 
anth-hbxA homeodomains also resemble the engrailed 


homeodomain of Drosophila. Anth-hbxB of Metridinm 
is most similar to empty-spiracles homologs from the 
mouse (EMX1 and EMX2). Anlh-hbxC from Nemato¬ 
stella exhibits similarity to a diverse group of genes, in¬ 
cluding Hox A8 from Branchiostoma and msxl from 
mouse, two members of the msh- class of homeodo¬ 
mains. 

Phylogenetic relationships of homeoboxes from sea 
anemones and higher metazoans 

Phylogenetic analyses of enidarians and bilaterians 
were used to further assess the identity of the anemone 
homeoboxes (Saitou and Nei, 1987). In addition to the 
12 anemone homeodomains and 13 sequences from 
other enidarians, our analysis included 20 sequences 
from bilaterian animals, each representing a different 
class of homeodomains (Biirglin, 1994). A neighbor¬ 
joining tree (Fig. 5) supports the finding that some of the 
anemone homeoboxes belong to previously proposed 
cnidarian classes (Naito et al., 1993; see previous section 
Comparison of homeoboxes from anemones and other 
enidarians). Anthoxl allies with members of Class I 
(enoxl.ilm and SAox3 ), anthoxl with members of Class 
II (enoxl.Cv and Xlo.v2), anthoxA with members of 
Class IV (enoxA.il m and enoxl.Cv), ant hox 5 with Class 
V (aiox5.1Im), and anth-eve with Class VII (eveC). All 
of these findings, w ith the exception of the precise rela¬ 
tionships of anthoxA, are supported bv a parsimony anal¬ 
ysis (Fig. 6). 

The phylogenetic analyses suggest that certain anem¬ 
one homeoboxes represent classes previously unidenti¬ 
fied in the Cnidaria (Figs. 5 and 6). Anthoxl a from Nem¬ 
atostella appears as the sister group to the Class I genes 
and may represent a new class of cnidarian homeoboxes 
(Class la). Anth-hbxA, found in both Metridinm and 
Nematostella. appears most closely related to the homeo¬ 
box from the human gene TCL-3, a non-cluster member 
of the Hexapeptide superclass (Biirglin, 1994). As sug¬ 
gested by the BLASTx search results (Fig. 3), Anth-hbxB 
from Metridinm appears most closely related to the Dro¬ 
sophila empty-spiracles homeobox, a gene known to be 
involved in head development in both Drosophila and 
vertebrates (Walldorf and Gehring, 1992; Simeone et al., 
1992). Finally, Anth-hbxC appears most closely related 
to the C. elegans homeobox ceh-9. The PCR surveys of 
sea anemones failed to recover representatives of cni¬ 
darian classes 3,4, 6, or 8. The BLAST searches and phy- 


Figure 4. Alignments of anemone homeodomain fragments lo similar homeodomains identified by 
searching the GenBank molecular database with the BLASTx algorithm (Basic Local Alignment Search 
Tool; Altschul et at.. 1990). The predicted amino acid sequences of anemone homeodomains (amino acids 
21 -47) are aligned to the five best matches identified by the database search in order of decreasing similarity 
lo the query sequence. Periods indicate identity with the anemone homeodomain. Accession numbers of 
GenBank sequences are presented in brackets. 
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Figure 5. Phylogenetic relationships of predicted homeodomain sequences from cnidarians and bilateriai s 
were inferred using the neighbor-joining algorithm (Saitou and Nei, 1987). The tree was generated as described 
in Figure 2. Cnidarian homeodomain sequences are labeled as in Figures 2 and 3, except that brackets are used 
to identify representative sequences from each cnidarian class (Naito et ai. 1993; roman numerals). Non- 
cnidarian homeodomain sequences were chosen to represent each of the 20 proposed classes of homeobox 
genes that make up an existing classification scheme (Biirglin, 1994). Non-cnidarian sequences are from Dro¬ 
sophila [D], C . elegant [C], and Homo sapiens [H]. Branch lengths in the horizontal direction are proportional 
to the number of expected amino acid substitutions that have occurred per site (scale at lower right). Branch 
lengths in the vertical direction are not significant. Circled numbers along branches indicate the percentage of 
trials in which a given partition between genes was found in 2000 replications of the bootstrap resampling 
procedure (Felsenstein, 1985). Only partitions supported in >40% of bootstrap replicates are indicated. Dashed 
lines indicate homeoboxes of the Hox class from Drosophila. These genes have differing anterior borders of 
expression along the anterior-posterior axis of the developing embryo. Labial is expressed more anteriorly than 
Antennapedia (Antp), which in tum is expressed more anteriorly than AbdominalB (AbdB). For this reason, 
Hox genes are sometimes classified according to their expression pattern in the embry o as anterior (e.g., labial ), 
central (eg- Antennapedia ), and posterior (e.g., AbdominalB). 
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Figure 6. Phylogenetic relationships of predicted homeodomain sequences from cnidarians and bilat- 
erians were inferred using the maximum parsimony criterion. The tree was generated as in Figure 3. Se¬ 
quences used are identical to those in Figure 5. Anemone homeodomains are indicated in boldface type. 
The tree depicted is a 50% majority-rule consensus of 123 equally parsimonious trees identified by tree- 
bisection and reconnection (TBR) branch swapping on 20 random starting trees. Tree length = 354 steps. 
Consistency index (Cl) = 0.506. Retention index (Rl) = 0.557. Branch lengths in the horizontal direction 
are proportional to the number of amino acid substitutions that have occurred along each branch. The 
number of amino acid substitutions is specified above each branch. Branch lengths in the vertical direction 
are not significant. Circled numbers indicate the percentage of trials in which a given partition between 
genes was found in 200 replications of the bootstrap resampling procedure (Felsen stein. 1985). Only parti¬ 
tions supported in >50% of bootstrap replicates are indicated. 
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* Cnox2 from Eleutlieria dichotoma. Hydra magnipapiUata. Hydra vulgaris, and Ilydractinia symbiolongicarpus was not included in the phylogenetic analyses because the homeodomains 
lave predicted amino acid sequence identical to that of cnox2 from Chlorohydra viridissima 
** The neighbor-joining tree (Fig. 5) supports inclusion of anthox4 in class IV, whereas the parsimony tree (Fig. 6) suggests that anthox4 represents a novel class. 
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Figure 7. Comparison of parsimony analyses based on nucleotide sequences and amino acid sequences. 
(A) Kuhn cl a/. (1996) inferred the phylogenetic relationships ofcnidarian hox gene fragments (aia\I-cno.\5 
or Eleitificria ilichotoma) to HOM/Hox homeoboxes from humans and Drosophila using parsimony. The 
tree depicted is a 50% majority-rule consensus of eight equally parsimonious trees found in a heuristic search 
based on unweighted nucleotide sites (180 base pairs): branch swapping by subtree-pruning-regrafting was 
performed on 100 random starting trees. (B) A reanalysis based on a parsimony analysis of unweighted amino 
acid sites (60 sites) using identical methods to search for the most parsimonious tree. 
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evolutionarily conserved genomic clusters. Further¬ 
more, individual Hox genes are expressed along the an¬ 
terior-posterior axis of the developing embryo in the 
same relative order as their position in the Hox cluster: 
genes located towards the 3' end of the cluster are ex¬ 
pressed more anteriorly than genes located towards the 
5' end of the cluster. This evidence suggests that a linked 
cluster of Hox genes is a shared, derived metazoan char¬ 
acter that is involved in patterning the anterior-posterior 
axis (Slack el al., 1993; Miller and Miles, 1993). How¬ 
ever, very little is known about the composition of the 
Hox cluster in basal metazoans such as the Cnidaria, or 
even whether such a cluster exists. 

As our knowledge of Hox genes in basal metazoans 
increases, we should be able to establish the antiquity of 
the Hox cluster and of individual Hox genes, and to 
make inferences about the primitive function of Hox 
genes. Among the relevant questions are (1) How many 
Hox genes are found in the Cnidaria? and (2) What are 
the relationships of cnidarian Hox genes to Hox genes 
from bilaterian animals? Phylogenetic analyses of verte¬ 
brate and insect Hox genes suggest a very early trichot¬ 
omy in the evolution of the Hox class—giving rise to an 
Ant ennapedia/deformed precursor, a labial/probosci- 
pedia precursor, and an AbdominalB precursor (Schu¬ 
bert el al., 1993). Previous studies revealed similarities 
between individual cnidarian homeoboxes and homeo- 
boxes from the Drosophila genes labial, proboscipedia. 
Deformed, and Antennapedia (Murtha el al., 1991; 
Schierwater el al., 1991; Schummer^Y al., 1992; Naito c/ 
al., 1993). These homeobox genes have been referred to 
as anterior and central members of the Drosophila 
HOM/Hox cluster because their anterior borders of ex¬ 
pression lie in the anterior half of the Drosophila embryo 
(Bartels el al., 1993). A homolog of the posterior Hox 
genes {AbdominalB- like) has not been recognized in the 
Cnidaria. 

With few exceptions {e.g., Naito el al., 1993), the iden¬ 
tities of cnidarian Hox genes have been considered in iso¬ 
lation: Hox class genes cloned from a single species are 
compared to Hox sequences from Drosophila and verte¬ 
brates without reference to Hox genes from other cni- 
darians or to non-Hox homeoboxes; these comparisons 
have been made using pairwise alignments {e.g., Schum- 
mer el al., 1992; Miles and Miller, 1992; Miller and 
Miles, 1993) or, more rarely, phylogenetic techniques 
(Aerne el al, 1995; Kuhn el al., 1996). In this study we 
have attempted a more systematic approach: we used 
phylogenetic techniques to simultaneously assess the re¬ 
lationships between all the reported cnidarian homeo¬ 
boxes and a broad representation of homeoboxes from 
bilaterian animals. As in several previous studies, our 
phylogenetic analyses suggest that certain cnidarian Hox 
genes are closely related to anterior and central members 
of the Hox cluster in higher metazoans (Murtha el al.. 


1991; Schierwater el al., 1991; Schummcr el al, 1992; 
Naito ei al., 1993). The members of cnidarian classes II, 
IV, V, and VI appear to be more closely related to the 
labial and Antennapedia homeoboxes of Drosophila 
than to the AbdominalB homeobox or any non-Hox ho- 
mcobox. Our analyses indicate that cnidarians may pos¬ 
sess homologs to posterior members of the complex as 
well as to the more anterior genes. The phylogenetic 
analyses (Figs. 5 and 6) suggest that members of the cni¬ 
darian homeobox classes I and la are most closely related 
to the Drosophila Abdominal-B homeobox, the most 
posterior member of the Drosophila Hox cluster. There¬ 
fore, distinct anterior/central {Antennapedia/labial- like) 
and posterior {AbdominalB-Wke) Hox gene lineages may 
have predated the split between the Cnidaria and the Bi- 
latcria (Schubert el al, 1993). Additional data on cni¬ 
darian Hox genes, such as their genomic organization 
and axial expression, may serve to corroborate or contra¬ 
dict this hypothesis. 

A recent paper by Kuhn el al. (1996) suggested a very 
different scenario for the origin of the HOM/Hox genes. 
Hox genes from triploblasts {Drosophila and humans) 
were compared with Hox genes from a cnidarian (the hy- 
drozoan Eleutheria dichotonui) by using parsimony and 
distance methods. In the phylogenetic analyses, the re¬ 
sulting topologies can be rooted such that genes from 
triploblasts form a monophyletic group to the exclusion 
of all cnidarian Hox-type genes, suggesting that the com¬ 
mon ancestor of triploblasts and cnidarians possessed 
only a single Hox-type gene. In other words, Kuhn el al. 

(1996) find no evidence of a Hox cluster in the common 
ancestor of diploblasts and triploblasts, whereas the anal¬ 
yses presented here are consistent with the existence of 
an ancestral cluster. 

The difference in the two phylogenetic conclusions de¬ 
rives from the choice of phylogenetic characters—the 
analyses reported here were based on the inferred amino 
acid residues within the homeodomain; those of Kuhn el 
al. (1996) were based on the entire nucleotide sequence 
of the homeobox, including rapidly evolving, silent nu¬ 
cleotide sites within codons. But the relevant evolution¬ 
ary time scale is very long: both cnidarian and bilaterian 
fossils are known from Cambrian sediments laid down 
more than 500 million years ago {e.g., Sepkoski, 1981), 
so more than 1 billion years of independent evolution 
separate the last common ancestor of modern-day 
triploblasts and cnidarians. Therefore, rapidly evolving 
sites are likely to be saturated with phylogenetic noise. 
Furthermore, if there are differences in codon bias be¬ 
tween taxa, the inclusion of silent sites in the analysis 
may falsely indicate phylogenetic affinity between non- 
homologousgene sequences derived from closely related 
taxa. When the phylogenetic analysis of Kuhn el al., 
(1996) is repeated with the inferred amino acid substitu- 
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tions as phylogenetic characters, the result is consistent 
with the existence of an ancestral cluster (Fig. 7). 

Confidence in inferring orihology ofhomeodomain 
sequences 

In trying to infer the evolutionary relationships of ho- 
meobox genes, we are confronted by historical limita¬ 
tions. Our inferences are based on a small region of se¬ 
quence that is highly conserved across very distantly 
related phyla. Sequence conservation outside the ho- 
meodomain is generally poor, so sequence alignment is 
difficult. The phylogenetic utility of sequences outside 
the homeodomain may therefore be limited. Consider¬ 
ing this historical limitation, our inferences about the 
identity of several of the cnidarian homeodomains ap¬ 
pear surprisingly robust. Several partitions on the neigh¬ 
bor-joining tree that suggest homology between cni¬ 
darian and bilaterian homeodomains are supported by 
more than 50% of bootstrap trials (for example, cni¬ 
darian even-skipped homologs 4- Drosophila eve: boot¬ 
strap proportion > 75%). Results from computer simu¬ 
lations and experimental bacteriophage phylogeniessug¬ 
gest that bootstrap proportions over 50% can indicate a 
much higher than 50% probability that the correspond¬ 
ing branch is correct (Hillis and Bull, 1993; Felsenstein 
and Kishino. 1993). The results presented here agree 
with an emerging pattern: newly discovered homeoboxes 
can often be assigned to previously described classes with 
reasonable confidence, though the interrelationships be¬ 
tween different classes of homeobox genes are difficult 
to reconstruct (e.g., Finnerty et al. , 1996). This pattern 
suggests that many of the homeobox classes are ex¬ 
tremely ancient, and that the sequences of these genes 
have been very strongly conserved. 
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